Structural alphabet Structural alphabet: from a local point of view to a global description of protein 3D structures
نویسندگان
چکیده
The study of protein structures’ local conformations has a long history principally based on the analysis of the classical repetitive structures (i.e. α-helix and β-sheet), and also on the characterization of some particular structures in the coil state (e.g. turns). The secondary structures are interesting for describing the global protein fold but miss all the orientations of the connecting regions and so neglect many particularities of the coil state. In order to take these structural features into account, we have identified a local structural alphabet composed of 16 folding patterns of five consecutive residues, called Protein Blocks (PBs). Conversely to the secondary structures, the PBs are able to approximate every part of the protein structures. These PBs have been used both to describe precisely the 3D protein backbones with an average rmsd of 0.42 Å, and to perform a local structure prediction with a rate of correct prediction of 48.7%. In this chapter, we present the interest of the Protein Blocks by comparing the secondary structure assignment with the assignment in terms of PBs. We highlight the discrepancies between different secondary structure assignment methods and show some interesting correspondence between particular local folds and the Protein Blocks. Then, we use the Protein Block prediction to classify proteins into the classical structural classes, namely all α, all β and mixed. The prediction rate of theses different classes is good, i.e. 71.5%, with no confusion between all α and all β classes. Finally, we present a new approach named TopKAPi that stands for “Triangular Kohonen Map for Analyzing Proteins”. It enables to classify and analyze proteins H A L athor m anscript inerm -004564, version 1
منابع مشابه
Discretization of 3D protein conformations by learning fragments library and their short range dependence using a Hidden Markov Model
The aim of this study is to discretize protein three-dimensional (3D) conformation with an optimal accuracy. In a previous paper, overlapping 4-peptide fragments describing (3D) conformations of proteins were systematically classified by a Hidden Markov Model (HMM) [2]. Using HMM allows moving the description of 3D structures from the sole geometric aspect towards a more explanatory description...
متن کاملiPARTS: an improved tool of pairwise alignment of RNA tertiary structures
iPARTS is an improved web server for aligning two RNA 3D structures based on a structural alphabet (SA)-based approach. In particular, we first derive a Ramachandran-like diagram of RNAs by plotting nucleotides on a 2D axis using their two pseudo-torsion angles eta and . Next, we apply the affinity propagation clustering algorithm to this eta- plot to obtain an SA of 23-nt conformations. We fin...
متن کاملSARSA: a web tool for structural alignment of RNA using a structural alphabet
SARSA is a web tool that can be used to align two or more RNA tertiary structures. The basic idea behind SARSA is that we use the vector quantization approach to derive a structural alphabet (SA) of 23 nucleotide conformations, via which we transform RNA 3D structures into 1D sequences of SA letters and then utilize classical sequence alignment methods to compare these 1D SA-encoded sequences a...
متن کاملProtein short loop prediction in terms of a structural alphabet
Loops connect regular secondary structures. In many instances, they are known to play crucial biological roles. To bypass the limitation of secondary structure description, we previously defined a structural alphabet composed of 16 structural prototypes, called Protein Blocks (PBs). It leads to an accurate description of every region of 3D protein backbones and has been used in local structure ...
متن کاملA Multi-strategy Approach to Protein Structural Alphabet Design
The search for structural similarity among proteins can provide valuable insights into their functional mechanisms and their functional relationships. Though the protein 1D sequence contains the information of protein folding, the performance of predicting the 3D-structure directly from the sequence is still limited. As the increase of available protein structures, we can now conduct more preci...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007